Active subclustering

نویسندگان

  • Arijit Biswas
  • David W. Jacobs
چکیده

Although there are many excellent clustering algorithms, effective clustering remains very challenging for large datasets that contain many classes. Image clustering presents further problems because automatically computed image distances are often noisy. We address these challenges in two ways. First, we propose a new algorithm to cluster a subset of the images only (we call this subclustering), which will produce a few examples from each class. Subclustering will produce smaller but purer clusters. Then we make use of human input in an active subclustering algorithm to further improve results. We run experiments on a face image dataset and a leaf image dataset and show that our proposed algorithms perform better than baseline methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-supervised and Active Image Clustering with Pairwise Constraints from Humans

Title of dissertation: Semi-supervised and Active Image Clustering with Pairwise Constraints from Humans Arijit Biswas, Doctor of Philosophy, 2014 Dissertation directed by: Prof. David W. Jacobs Department of Computer Science University of Maryland, College Park Clustering images has been an interesting problem for computer vision and machine learning researchers for many years. However as the ...

متن کامل

Star Formation in Clusters : Subclustering , Cloud Fragmentation and the Origin of the Stellar IMF Leonardo

We review recent high spatial resolution millimeter continuum and spectral line observations of (proto-)cluster regions. These observations reveal that the mass distribution of prestellar cores is consistent with the initial mass function for field stars suggesting that the IMF is connected to the molecular clouds structure or the cloud fragmentation processes, rather than the details of the st...

متن کامل

Human Bocavirus in Patients with Encephalitis, Sri Lanka, 2009–2010

We identified human bocavirus (HBoV) DNA by PCR in cerebrospinal fluid from adults and children with encephalitis in Sri Lanka. HBoV types 1, 2, and 3 were identified among these cases. Phylogenetic analysis of HBoV1 strain sequences found no subclustering with strains previously identified among encephalitis cases in Bangladesh.

متن کامل

The build - up of the Coma cluster by infalling substructures

We present a new multiwavelength analysis of the Coma cluster subclustering based on recent X-ray data and on a compilation of nearly 900 redshifts. We characterize subclustering using the Serna & Gerbal (1996) hierarchical method which makes use of galaxy positions, redshifts, and magnitudes, and identify 17 groups. One of these groups corresponds to the main cluster, one is the well known gro...

متن کامل

Sub-clustering in decomposable graphs and size-varying junction trees

Abstract: This paper proposes a novel representation of decomposable graphs based on semi-latent tree-dependent bipartite graphs. The novel representation has two main benefits. First, it enables a form of subclustering within maximal cliques of the graph, adding informational richness to the general use of decomposable graphs that could be harnessed in applications with behavioural type of dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Vision and Image Understanding

دوره 125  شماره 

صفحات  -

تاریخ انتشار 2014